REMARKS 

Telephone Interview 

Applicants would like to express their appreciation to Examiner Chemyshev and Examiner 
Eyler for the courtesy extended to Applicants* agent, Angela Dallas Sebor, during the telephone 
interview on October 2, 2003. During the interview, the outstanding issues under 35 U.S.C. § 112, 
first paragraph were discussed. Examiner Eyler indicated that the Examiners would be v^thdrav^ng 
finality and reopening prosecution to reconsider issues under 35 U.S.C. § 1 12, first paragraph, on 
the basis of written description. During the interview, possible claim amendments were discussed 
that may place the claims in a condition for allowance or at least reduce issues upon the reopening 
of prosecution. In particular, the Examiner suggested that structural and more specific ftmctional 
language be provided in the claims to meet the requirements for written description. Also, 
Applicants' agent indicated that a reference would be provided to demonstrate that one of skill in the 
art at the time of the invention would possess sufficient knowledge regarding the structure of FGF-2 
to understand where modifications could be made to the protein without destroying functional 
activity of the protein. The Examiners indicated that any amendments presented in response to the 
final Office Action would be entered and considered, given the proposal to reopen prosecution. 

Claim Amendments : 

The claims have been amended in a manner discussed by the Examiner. Specifically, the 
independent Claim 1 has been amended to recite specific structural and functional limitations for the 
FGF-2 protein portion of the chimera, as well as for the penetratin peptide portion of the chimera. 
All of the amendments are supported in the original specification and claims as filed. The 
limitations of Claims 4 and 9 and from the specification on page 1 7, lines 11-16 have been imported 
into Claim 1 and Claims 4 and 9 have been cancelled, without prejudice to or disclaimer of the 
subject matter therein. Support for the more specific functional language regarding FGF-2 is found 
in the specification on page 13, lines 1 1-27. 

The structure-function relationship of FGF-2 proteins as knovra in the art and described in 
the specification is discussed in detail below under the response to the rejection under 35 U.S.C. § 
112, first paragraph. With regard to the penetratin peptide portion of the chimeric protein, 
Applicants submit that the specification and the art at the time of the invention provide a detailed 
description of the structure and function of the peptide, such that one of skill in the art would be able 
to recognize that the inventors were in possession of the invention as claimed at the time of filing. 
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Specifically, Applicants refer to the specification at page 10, lines 3-14; page 20, line 9 to page 23, 
line 14, where penetratins are discussed both structurally and functionally in detail, including 
multiple examples of penetratin sequences useful in the invention from a variety of sources, and 
including reference to several publications which further describe such sequences. The motif 
described on page 21 and recited in the claims describes for the skilled practitioner what structural 
features are required of a penetratin peptide and the specification further teaches the function of the 
peptide and how to ascertain whether a peptide has such function (e.g., see Example 3). 

Objection to the Specification and Rejection of Claim 4 Under 35 U.S.C. S 1 12. First Paragraph: 
The Examiner has objected to the specification and rejected Claim 4 under 35 U.S.C. § 1 1 2, 
first paragraph, on the basis of written description. Specifically, the Examiner contends that the 
specification fails to describe the entire genus of proteins encompassed by the claim, which recites 
proteins that are at least 70% identical to the reference FGF-2 sequence. The Examiner also asserts 
that the claims recites a chimeric FGF protein that is 70% identical to an FGF-2 sequence, but fails 
to indicate other relevant identifying characteristics, such as physical and/or chemical and/or 
functional characteristics coupled with a knovm or disclosed correlation between structure and 
function. 

Applicants traverse the rejection of Claim 4 under 35 U.S.C. § 1 1 2, first paragraph. Initially, 
it is noted that Claim 4 has been cancelled and therefore, the rejection of Claim 4 is moot. However, 
the limitations of Claim 4 have been added to Claim 1 and so the rejection will be addressed with 
regard to Claim 1 and as discussed in the October 2 telephone interview. 

First, Applicants wish to clarify that, contrary to how the Examiner describes Claim 4, the 
claim does not reference the entire FGF chimeric protein with regard to the percent identity. Rather, 
the claim recites the portion of the chimeric protein that has FGF-2 activity with regard to the percent 
identity. 

Second, it is submitted that the specification, combined with the knowledge in the art at the 
time of the invention, provides sufficient guidance regarding the structure and function of FGF-2 
proteins that one of skill in the art would readily be able to identify and produce homologues of FGF- 
2 proteins that have the recited identity to the reference sequence and retain FGF-2 biological 
activity, such that it is clear that the inventors were in possession of the claimed invention at the time 
of filing. First, with regard to the function of FGF proteins, the specification provides a detailed 
discussion of the biological activities of FGF proteins, including references to a variety of specific 



9 



art-recognized assays for measuring such activity (e.g., see page 2, line 3 to page 3, line 20; and page 
13). In addition, the specification describes two specific FGF-2 sequences (SEQ ID N0:5 (bovine) 
and SEQ ID N0:6 (human), teaches that numerous FGF proteins are known in the art and share 
similar requirements for FGF receptor binding and heparan sulfate dependence, for example, and 
teaches methods for modifying FGF-2 proteins and for determining percent identity (e.g., page 1 5, 
lines 14-27; page 17, line 17 through page 20, Hne 8). 

Moreover, as discussed in the October 2 interview, the structure of FGF-2 proteins fi-om 
many animal species was well known in the art at the time of the invention and significant 
knowledge was available regarding which amino acid residues are conserved among species and 
which amino acids contribute to the biological activities of FGF-2 (i.e., a structure-to-fimction 
relationship), such that one of skill in the art would readily be able to produce a modified FGF-2 
protein that retains biological activity. 

In support of Applicants' position, enclosed herewith are two publications which describe 
many details of the structure of FGF-2 which are correlated with the protein fiinction and which were 
known in the art at the time of the invention. First, referring to the enclosed review article by Omitz 
and Itoh {Genome Biology 2(3):3005. 1-3005. 12, 2001), this review shows that FGF proteins "have 
been intensely studied for nearly 30 years" (page 6, col. 2). The review makes reference to studies 
on the characteristic structure correlated with fiinction of FGF proteins, including specifically FGF-2 
(page 2-3; and Fig. 3), and it is noted that much of this information, including the three dimensional 
structure of FGF-2 was known prior to the present invention. The Omitz reference cites Coulier et 
al., which is the second enclosed reference. 

More specifically, Coulier et al. (J. Mol. Evol. 44:43-56, 1997) illustrates in great detail the 
conserved stincture of FGF proteins, and specifically indicates what amino acids in tiie structure are 
correlated with fimction. For example, Coulier et al. provide an amino acid alignment of sequences 
from 40 different FGF species, including eight FGF-2 species (see Fig. 1), and describe/illustrate 
which amino acids are absolutely conserved among all FGF proteins, which amino acids are 
absolutely conserved among all the FGF-2 proteins, which amino acids are similar between all FGF 
or FGF-2 proteins, and finally, provide a discussion of which regions of the protein correlate to 
specific FGF fimctions (see page 45 through page 47). This reference shows that 10% of residues 
(12 residues of the 120 amino acid core sequence) are conserved among all FGF proteins, with 
highly conserved positions (e.g., positions 79-89) and regions with limited similarity (e.g., positions 
97-1 14 or 138-146). Coulier et al. mention important amino acids that have been correlated with 
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function, including two cysteine residues, the receptor binding site, and the heparan sulfate binding 
site, as well as important residues for saccharide binding. It is further noted that among FGF-2 
species, Table 1 shows that the eight amino acid sequences for FGF-2 have differences that produce 
identities across species between 84% and 98%, and it is submitted that one of skill in the art would 
readily be able to make additional modifications to the sequences given the clear structure-to- 
function correlation established by just this reference and predictably produce a functional synthetic 
variant of the human or bovine FGF-2 sequences, which are the recited sequences. Therefore, it is 
submitted that Applicants are entitled to the breadth of claims as recited, since the specification, in 
combination with the knowledge in the art, provide significant detail regarding the structure-to- 
function relationship for FGF-2. 

In view of the foregoing remarks, Applicants respectfully request that the Examiner withdraw 
the rejection of Claim 4 under 35 U.S.C. § 1 12, first paragraph. 

Applicants have attempted to address all of the issues as set forth in the July 1 0 Office Action 
and as discussed in the October 2 telephone interview. In the event that the Examiner has any 
additional questions or concerns regarding the present claims, she is encouraged to contact the 
below-named agent at (303) 863-9700. 



Respectfully submitted. 



SHERIDAN ROSS P.C. 




Angela DfiUas Sebor 
Registration No. 42,460 
1560 Broadway, Suite 1200 
Denver, CO 80202-5141 
(303) 863-9700 




11 



J Mol Evol (1997) 44:43-56 



jouuuLOfMOLKULAR 
lEVOUJTIDN 



Sprii^-Veriag New Yort: Inc. 1997 



Of Worms and Men: An Evolutionary Perspective on the Fibroblast Growth 
Factor (FGF) and FGF Receptor Families 

Francois Coulier,* Pierre Pontarotti/ Reg^e Roubin/ He^ Hartang,^ MitcheU Goldfarb,^ 
Daniel Bimbaum^ 

' Laboratoire d'Qnoologie Moleculaire, U.119 INSERM, 27 Bd Let Roure, 13009 Marseille, France 

^ Brookdale Center for Molecular Biology, Mount Sinai School of Medicine, 1 Gustave U Levy Place, New Yoik, NY 10029-6579, USA 
Received: 6 April 1996 / Acc^ted: 5 July 1996 



Abstract FGFs (fibroblast growth factors) pl^ major 
roles in a number of developmental processes. Recent 
Studies of several human disorders, and concurrent 
analysis of gene knock-out and properties of the cone- 
sponding recombinant proteim have shown that FGFs 
and their receptors are prominently involved in the de- 
velopment of the skeletal system in mammals. We have 
compared the sequences of the nix^ known mammalian 
FGFs, FGFs from other vertebrates, and three additional 
sequences that we extracted from existing databases: two 
human FGF sequences that we tentatively designated 
FGFIO and FGFll, and an FGF sequence from 
Ceenorhabditis elegans. Similaxiy, we have compared the 
sequences of the four FGF receptor paralogs found in 
chordates with four non-chordate FGF receptors, includ- 
ing one recently identified in C. elegans. The comparison 
of FGF and FGF receptor sequences in vertebrates and 
nonvertebrates shows that the FGF and FGF receptor 
families have evolved through phases of gene dupUca- 
tions, one of which may have coincided with the emer- 
gence of vertebrates, in relation with their new system of 
body scaffold. 

Key words: FGF — FGF receptor — Phyloger^^ — 
Vertebrate — Invertebrate — Gene duplication 
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Introduction 

A man may fish with the worm that hath eat of a king, 
and eat of the fish that hath fed of that worm. 

Shakespeare, Hamlet 

Comparative genomic and developmental analyses 
may provide clues for understanding the origin of gerKS 
as well as help in linking macro-evolutionary morpho- 
logical transformations to roodifications in embryonic 
patterns of e?q)ression of specialized genes (Sordino et aL 
1995; Coates 1995). We compiled available data on the 
FGF (historically, fibroblast growth factors) and FGF 
receptor (FGFR) gene families in various organisms, and 
we review these data in an evolutionary context. In mam- 
mals, the FGF famify oirrentfy con^)rises 11 members 
which interact with membrane-associated tyrosine kinase 
receptors (FGFRl to FGFR4), and with heparan-sulfate 
proteoglycans. FGFs/FGFRs interactions play major 
roles in various developmental processes involving for- 
mation of mesoderm during gastrulation, integration of 
growth, budding and patterning during early postimplan- 
tation, and development of various tissues such as ear, 
limb, hair, and the skeletal system (Johnson and Wil- 
liams 1993; Mason 1994; WiUde el al. 1995; Yanmguchi 
and Rossant 1995). These roles have been estabUshed 
through study of patterns of e?q)ression, gene invahda- 
tions, physiological e>q)eriii£nts using application of 
beads soaked with recombinant proteins, and analysis of 
human hereditary skeletal disorders (Johnson et al. 1994; 
Niswander et al. 1994; Muenke and Schell 1995; Tanaka 
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and Gann 1995; Tickle 1995; Yamaguchi and Rossant 
1995). Two new human FGF genes, and for the first time 
an invertebrate (C eiegans) sequence that has the poten- 
tial to code for an FGF-related molecule, have been iden- 
tified in databases (Wilson et al. 1994; Hodgkin et aL 
1995), confirmed by more extensive DNA sequencing, 
and added to the existing list of FGF members. Recently, 
a C eiegans sequence encoding an FGF recq)tor was 
reported (DeVore et al. 1995). The discoveiy of FGF and 
FGF receptor-encoding genes in worms allows for 
speculation both on the role of such factor-receplor in- 
teractions in nonvertebrates and on the evolution of the 
families. 

In line with the proposed function of FGF/FGFR in- 
teractions in the development of the skeletal system, we 
suggest that an ingwrtant increase in the number of FGF 
genes might be associated with tte period of macro- 
evolutionary change that coincided with the origin of 
vertdjrates and might have thus provided information in 
the making of the skeletal system. 

Methods 

Sequences. Protein sequences were obtained directly from EMBL, 
NCBl (Genbank and dbEST), or Swissprol databases. When needed, 
nucleotide sequences (also obtained from databases) were tnuislated 
using the PCGene (Intelligenetics) package. Additional sequencing was 
done by automated methods using Applied Biosystem 3 73 A instru- 
ment. 

Species abbreviations are as follows: bt: Bos taurus (bovine); cc: 
Coturrdx cotiartix (quail); ce: Caenorhabcbtis el^arts (nematode); dm; 
DrosopMla meianogaster (firuit fly); dn Damo rerio (zebrafishX gd: 
Gallus domesticus (chicken); hs: Homo sapiens (human); ma; Meso- 
cricetus auratus (golden hamster); md; monodelphis domestica (short- 
tailed opossum); mm: Mus musculus (mouse); nv: Notophtalmus viri- 
descens (newt); oa; (his caries (sheepX ol; Oryzxas latipes (medaka 
fish); pw: Pleurodeles wdLtl (Iberian ribbed newtX m: Rattus norvegt- 
cus (rat); sp: Stroxyglocentrotus purpuratus (purple urchin); ss; Sus 
scrcfa (pig); xl: Xenapus laevis (African clawed frog). 

Accession numbers for FGF and FGFR sequences are as follows: 
bt-FGFl: P0396^, bt-FGF2: P03969; bt-FGF4: P48803; cc-FGFR4: 
X76885; ce-FGF: U00048,; ce-FGFR: U39761; dm-FGFRa; Q07407; 
dm-FGFRb: Q09147; dr-FGF3: P48802; dr-FGFR4: U23839; gd- 
FGFl: A60130; gd-FGF2: P48800; gd-FGF3: P48801; gd-FGF4; 
P48804; gd-FGFRl; M24637; gd-FGFR2; M35196; gd-FGFR3; 
M35195; hs-FGFl: P05230; hs-FGF2: P09038; hs-FGF3: PU487; hs- 
FGF4: P0X62O; hs-FGF5: P12034; hs-FGF6: P10767; bs-FGF7: 
P21781; hs-FGF8: g999172; hs-FGF9: P31371; hs-FGFlO: Z70275, 
T27215, H15590; hs-FGFl 1: Z70276, H19128, H62672, R58169, 
H2881 1; hs-FGFRl: PI 1362; hs-FGFR2; P21802; hs-FGFR3; P22607; 
hs-FGFR4: P22455; hs-ILlb: P01584; hs-ILlR: P14778; hs-SRC: 
P12931; ma-FGFl: P34004; md-FGF2; P48798; mmTFGF2: P15655; 
mm-FGF4: P11403; mm-FGF5: P15656; mm-FGF6; P21658; mm- 
FGF7: P36363; mm-FGF8: P37237; nv-FGFR2; L19870; oa.FGF2: 
P20003; oa-FGF7:P48808; ol-FGFRl: D13550; ol-FGFR2: D13551; 
01-FGFR3: D13552; ol-FGFR4: D13553; pw-FGFRl: X5938a, pw- 
FGFR2:X74332; pw-FGFR3: X75603; pw-FGFR4: X65059; m-FGFl: 
P10935; m-FGF2: P13109; m-FGF5: P48807; m-FGF7: Q0219^ m- 
FGF9: P36364; sp-FGFR: U 17164; ss-FGFl: JH0476; xl-FGF2: 
P12226; X1-FGF3: P36386; xl-FGF4-I: P48805; xl-FGF4-n: P48806; 
xl-FGFRl: M61687; xl-FGFR2: X65943; 3d-FGFR4: D31761. 

The hs-FGFlO sequence was obtained by assembUng EMBL data- 



base seqiences HI 5 590 and T27215, along with extensions and cor- 
rections from additional sequencing of the corresponding clones, 
ym27b06 and MT0120, respectively. 

The hs-FGFl 1 sequence was obtained by assembling EMBL data- 
base sequences H19128, H62672, R58169, and H28811, along with 
extensions and corrections from additional sequencing of clone 
ym44el2 corresponding to H19128. 

The hs-FGFlO and hs-FGFl 1 core sequences have been submitted 
to the EMBL database and assigned accession numbers Z70275 and 
Z70276, respectively- 

Sequence AUgnmeTiL All protein sequences were aligned using the 
Chistal W program (Tliompsoo et aL 1994). 

FGF "core" sequences corresponding to hs-FGFl amino acids 
28-151, after removing internal insertions in FGFl (corresponding to 
amino acids 120 and 121 of hs-FGFl X FGF3 (corresponding to amino 
acids 137-152 of hs-FGF3X FGF5 (conesponding to amino adds 181- 
186 of hs-FGF5), FGF7 (corresponding to amino acids 159-162 of 
hs-FGF7), FGF9 (corresponding to amino acids 156-161 of hs-FGF9), 
hs-FGFlO (amino adds 92-97 of the putative partial sequence), hs- 
FGFl 1 (amino adds 92-97 of the putative partial sequence), and ce- 
FGF (amino adds 92-95 of the putative sequence)^, were used for the 
aUgrmient. 

FGFR extracellular domain sequences (acidic box, Ig loops H and 
HI) corresponding to amino acids 119-359 of hs-FGFRl were aligned 
after removing internal insertions in ce-FGFR (amino adds 459-465) 
and sp-FGFR (amino acids 441-444 and 455-481). 

FGFR tntracdhilar domain sequences (kinase subdomains n-VH; 
Hanks et aL 1988 corresponding to amino adds 492-663 of hs-FGFRl ) 
were ahgned after removing internal insertions in ce-FGFR (amino 
acids 747-750 and762-773), dm-FGFRa (anoino acids 519-522), and 
dm-FGFRb (amino adds 823-829). 

Phylogeny Inference. Phylogen^c trees were constrticted using the 
distance matrix (Dayboff PAM matrix) and ndghlxM'-joining algo- 
rithms of the Phylogeny InfCTence Padcage of J. Felsenstein (Felsen- 
stein 1989> Human Interleukin Ip (hs-ILlbX human interleukin 1 
preceptor (hs-lLlR), and human c-SRC (hs-SRC) sequences were 
used as outgroups in the constructions of trees for the FGFs and the 
FGFRs ecto- and kinase domains, respectively. 

Bootstrq;}ping is a resampling technique that allows one to calculate 
confidence limits on trees (Felsenstein 1985)l The bootstr^ value (in 
percentages) indicates the number of times a given branching occurs 
amcmg the bootstrapped samples and is a measure of the significance of 
a grouping with respect to the particular data set and to the method used 
for drawing the tree (Higgins et al. 1991), To test the validity (robust- 
ness) of branching, a total of 2,000 bootstrapped data sets were sub- 
jected to analysis, and a consensus tree was obtained using the Con- 
sense program (Felsenstein 1989). 



Results 



Identification of New FGF Genes FGFIO, FGFl I, 
and ce-FGF 

Database searches with known FGF sequences allowed 
us to identify FGF-related human cDNA clones 
(ym27b06, yr45d03, ym44el2 (Hillier et al. 1996), and 
MTO120 (Brody et al. 1995)), isolated as Expressed Se- 
quence Tag (EST), and a cosmid clone (C05D11) (Wil- 
son et al. 1994), isolated from a C eiegans genomic 
libraiy . Based on nucleic acid and amino add alignments 
of the partial sequences, we were able to identify the 
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'foblc 1. Amino acid identity score between FGF sequences 
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human clones as two new FGF sequences, tentatively 
caUed hs-FGFlO (ym27b06 and MTO120) and hs- 
FGFl 1 (yr45d03 and ym44el2). Further sequencing of 
these clones allowed us to derive the peptide sequences 
for the complete core region (Couher et al. 1991, 1994). 
hs-FGFlO- mi hS'FGF J I -mhiod genomic sequences 
^pear to exist in the mouse (data not shown). 

Z. Du and R. Waterston (ui^ublishecQ predicted the 
existence of a gene encoding a heparin-binding growth- 
factor-related peptide within cosmid C05D11 derived 
from chromosome HI of C. elegans. Alignment of this 
peptide sequence, tentatively called ce-FGF, with that of 
known FGFs, revealed 21-37% of amino acids identity 
with the other members of the femily. We have deter- 
mined that mRNA corresponding to this gene can be 
found in larval stages of C. elegans (data not shown). 



FGF Sequence Comparisons Show Conserved and 
Variable Stretches of Amino Acids 

The core amino acid sequence of 39 identified verte- 
brate FGFs (corresponding to 1 1 manunalian, four avian, 
one fish, and three amphibian FGF paralogs) and the C 
elegans FGF (ce-FGF) were aligned for comparison 
(Table 1; Fig. 1). The core sequence was obtained by 
deletion of N- and C-terminal extensions as well as spe- 
cific internal sequences. It represents about 120 amino 
acid residues in length. Twelve positions (10%) were 
found conserved in all sequences. The conservation 
among the other residues varies along the core sequence, 
with scattered clusters of highly conserved positions (for 
example, 79-89) and regions with limited similarity (for 
example, 97-114 or 138-146). 
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Fig. 1. Aligomeat of FGF sequences. Sequ^ces d^ved trran 40 known FGF proteins wgfe aligned, allowing tor gaps (-) in order to optimize 
the alignment. Positions of perfect identity are indicated (blade boxes), while open boxes contain amino acids that are identical or similar in at least 
eight orthologous sequ^ces. Amino acid numbering is according to human FGFl sequence. 



Eaiiier studies onFGFs have tried to define structuie- 
function relationships using mutants or protein fragments 
(J^e et al. 1986; Presta et al. 1992). The determination 
of the three-dimensional structure of FGFl and FGF2 
has brought clues about the impoitance of some amino 



acid residues (Eriksson et al. 1991; Zhang et al. 1991; 
Zhu et aL 1991; Faham et al, 1996). These include the 
role of the cysteine residues and the position of the hepa- 
ran-sulfate- and receptor-*inding sites. The two cys- 
teines consistently present in FGF sequences do not seem 
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Ikble 2. Amino acid identity score between FGFR sequences 
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to foim intramolecular bonds. Indeed, one of the cys- 
teines (position 98 in Fig. 1) is conserved in all FGFs, but 
the other (position 3 1) is present in most FGFs but not in 
FGFS, FGFll, and ce-FGF. The presumed receptor- 
binding site of FGF2 has been located in the region de- 
limited by residues 106-121. The differences in amino 
acid sequences within this stretch (which includes inser- 
tions of up to 16 residues in FGF3) could account for the 
discrimination between different receptors. Basic resi- 
dues K32, K127, R128, K133, K137, and K143 (as num- 
bered in Fig. 1), are thought to constitute a potential 
binding site for the sulfate group of heparin and other 
sulfated substrates (Eriksson et al. 1991; 23iang et aL 
1991; Zhu et al. 1991; Faham et al. 1996). Other residues 
possibly involved in saccharide binding include N33, 
N107, and Q142 (Faham et al. 1996). They are relatively 
well conserved in all FGF sequences except for K32, 
N33,K127, andK143. 



FGFR Sequence Comparisons 

The sequences of 25 FGF receptors identified in various 
species, including recently pubhshed FGFR sequence 



firom C elegans (DeVore et al. 1995), were aligned for 
comparison (Table 2 and Fig. 2). Comparisons were 
done separately to the extracellular and kinase domains 
but led for similar conclusions. In vertebrates, FGFRl, 
FGFR2, and FGFR3 orthologous sequences have more 
than 80% identity. FGFR4 sequeiK;es are sUghtly more 
different The four identified invertebrate sequences, in- 
cluding the two Drosophila FGFR, share an average of 
30% identity in the extracellular domain, and from 49 to 
67% in the kinase domain, with FGFRl to FGFR4 ver- 
tebrate sequences, and thus could not be identified as 
orthologs of any of them. 

The extracellular regions of the FGF receptors contain 
22 amino acid residues conserved throughout all se- 
quences (Fig. 2A). This includes cysteine residues in- 
volved in the tertiary structure of immunogjobulin-like 
domains. Some of these residues, such as C278, Y340, 
C342, and S347 of FGFR2, are targets of mutations 
found in human inherited skeletal disorders leading to 
receptor dysfimction (Muenke and Schell 1995; Mulvi- 
hill 1995; WilMe et al. 1995; Yamaguchi and Rossant 
1995). However, several mutations occur in residues 
which are voi conserved in invertebrates. One residue 
that is not conserved in invertebrate FGFRs is a proline 
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R§. 2. Aiigmnent of FGFR sequences. A Alignmait <^ extiacelhiiar 
domains. Sequences derived from the extraceUuJar domain of 21 
known FGF-Receptor proteins were ab'gned, allowing for gaps (-) in 
order to optimize the alignment Portions of sequences used for the 
alignment include the acidic box and the two C-terminal Ig loops. 
Positions of perfect identity are indicated {blade boxes). Open box 
indicates the position of the acidic domaiiL Amino acid numbering is 



according to human FGFRl sequenced Alignmait of kinase domains. 
SequCTces derived from the kinase domain of 25 known FGF-Receptor 
proteins wae aligned, allowing tor gaps (-) in order to optimize the 
alignment Portions of sequences used for the ahgnment include kinase 
subdomains II- VII (Hanks et al. 1988). Positions of perfect identity are 
indicated {black boxes). Amino acid numbering is according to human 
FGFRl sequence. 
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Fig. 2. Continued. 

(position 252 in FGFRl and 253 in FGFR2) (Fig. 2). 
Substitution of this proline by arginine in FGFRl and 
FGFR2 is found in two craniosynostosis syndromes, the 
Pfeiffer and Apert syndromes, respectively. Interest- 



ingly, an arginine residue is foimd at the corresponding 
position in the C elegans FGFR sequence. The point 
mutations found in human disorders are dominant and 
are presumed to activate constitutively the receptor. It 
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Fig. 3. Phylogenetic anaJysis of the FGF family. The 40 sequences 
aligned in Fig. 1, including mammalian, avian, batracian, and 
invertebrate FGF sequences, and human interleukin 1-p as an 
outgroiq) {hs-IL IV), were used to infer a phylogenetic tree. A 
Hiylogenetic tree where branch lengths are grossly proportional to 
calculated genetic distances, except for hs-IL lb. Only the bootstnq) 
values higher than 75% are indicated. B Phenogram representation 
of the inferred phylogenetic tree, and close-up view of the 
FGF4/FGF6 groi^. Branch lengths are arbitrary, Bootstr^ values 
are indicated for all the nodes that group together FGF paralogs. 
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would be interesting to test whether the counter mutation 
in C elegans (i.e., arginine to proline) leads to a change 
in ce-FGFR activity. 

Evolution of the FGF Family 

The 40 sequences ahgned in Fig. 1 were used to infer a 
phylogenetic tree (Fig. 3) by using the neighbor-joining 



algonthm (Felsenstein 1989). The sequence of human 
interieukin-ip, a peptide regulatory factor distantly re- 
lated to FGFs (Gimenez-Gallego et al. 1985; Eriksson et 
al, 1991; Zhang etal. 1991;Zhuetal. 1991), was used as 
an outgroup. Use of the parsimony algorithm led to a 
similar topology. 

All paralogs were grouped togqther, bootstr^ values 
exceeded 90%. 
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Fig. 4. Phylog^etic analysis of the FGFR family. The 25 kinase 
domains sequences aligned in Fig. 2B, inchicfing mammalian, avian, 
amphibian, fish, and invertetvate FGFR sequences, and human c-SRC 
as an outgroup (hs-SRC), were used to infer a phylog^etic tree. A 
Fhylogenetic tree where branch lengths are proportional to calculated- 
genetic distances, except for hs-SRC. A closer view of the vertebrate 
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FGFR subtree is shown on the right Only the bootstr^ values higher 
than 75% are indicated. B Phenogram representation of inferred phy- 
logenetic tree using either FGFR kinase or ectodomains. Branch 
lengths are arbitrary. Bootstr^ values are indicated for all the nodes 
that group together FGFR paralogs. 



Studying the biaiK^hing between paialogs showed that 
the only branchings wifli high confidence intervals were 
between FGFl and FGF2 (bootstrap value of 81%, see 
Materials and Methods section), FGF4 and FGF6 (98%), 
and FGFIO and FGFll (99%). None of the other branch- 
ings between paralogs could exceed a bootstrap value of 
31%. FGF1/FGF2, FGF4/FGF6, and FGFlO/FGFll are 
pairs of FCjFs that show the highest similarity score (T^le 
1), suggesting the corresponding paiis of genes duplicated 
more recently as conqiared to the otter FGF gqnes. 

Evolution of the FGF Receptor Family 

A similar analysis was conducted with FGF receptor se- 
quences (Fig. 4). Separate analyses of the extracellular 



region and intracellular kinase domain were performed 
usiqg both neighbor-joining and parsimoi^ algorithms, 
and yielded similar topologies (Fig. 4B). Comparison 
with the corresponding sequence alignments (Table 2) 
showed that changes in the kinase domain have occurred 
at a much slower rate. 

Invertebrate FGFRs from three ^)ecies appeared in 
separate branches which were all distinct from the ver- 
tebrate sequences (Fig. 4A). FGFR paralogs from verte- 
brate species were grouped together with high bootstrap 
values. The two FGFRs from D, melanogaster were dis- 
tinct fiom vertebrate FGFRs and branched together with 
a very high value of bootstr^ (99%), suggestive of a 
very recent duplicatioa This topology is in support of the 
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Fig. & Representation of a hypothetical FGF and FGFR evohition 
scheme. This evolutionary scheme is based on the available informa- 
tion on FGF and FGFR sequences in various species. Putative phases 
of gene duplications (shown on the left) are individualized, and are 
tentatively related toa phylogenetic tree of Metazoa (shown on the 
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right). FGF and FGFR e?q>ansion is shown to occur after the origin of 
nematodes and echinodenns, but the timing of this expansion will only 
be precisely fixed i^Km detennination of the number of FGF and FGFR 
sequences in these species. 



existence of one FGFR gene (or a lecenlly duplicated 
one like in D. melanogaster) in invertebrates and of an 
expansion to four members in vertebrates. 

Discussion 

This paper reports the presence of an FGF gene in an 
invert€i)rate species, the identification of two new human 
FGF, and the comparison of their deduced amino acid 
sequences with the known members of the FGF family. 
A similar analysis was done with FGFR sequences from 
invertebrate and vertebrate species. Phylogenetic trees 
inferred from the calculated genetic distances allow for 
hypotheses concerning the timing and functional signifi- 
cance of FGF and FGFR gene-famify expansion ai^ 
divergence. 

Hypotheses for the Origin of the Topologies 

Several l^potheses may e?q)lain the observed topologies. 
It is possible to individualize a few steps in a series of 
duplications firom a common ancestor at the origin of the 
present-day members of the FGF family. The most re- 
cent steps gave rise to FGF4 and FGF6 and to FGFIO 
and FGFIL FGFl and FGF2 may have been created at 
a similar period, or slightly before FGF4/FGF6 and 



FGFIO/FGFII divergeiKes. We call this step the phase 
of late duplications (Fig. 5). A more ancient series of 
duphcations, occurring over a limited time period, led to 
the emergence of eight members — ^namely, FGF3, 
FGF 5, FGF7, FGF8, FGF9, and the FGFl /FGF 2, 
FGF4/FGF6, and FGFIO/FGFU putative ancestors. 
The topology of the tree suggests that these eight mem- 
bers could have derived from a single gene; the emer- 
gence of several FGFs would have occurred during a 
phase of m^or genome expansion The topology of the 
FGFR tree suggests the existence of an ancestral FGFR 
gene and only one phase of expansion, leading to four 
FGFR members as identified in vertd)Tates. 

When did the FGF expansion occur with respect to the 
origin of the branch leading to nematodes? 

Tte e?q>ansion could have occurred before the sepa- 
ration between protostomia and deuterostomia, and to 
eight FGFs m^ be rq)resented in nematodes. If this is 
the case, it could mean that otter putative ce-FGF genes 
may exist and be related to a particular mammahan FGF. 
Interestingly, a number of genes fiom chromosome 17 in 
humans and chromosome III in C. eiegans (Ruddle et al. 
1994) are presumed to be homologs, suggesting that ce- 
FGF and hs-FGFJO might be paralogs. 
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FGF/FGFR Expansion May Be Associated with the 
Origin of Vertebrates 

Alternatively, the FGF expansion could have occuned 
after the protostomia/deuterostoada separation (Fig. 5). 
One can assume, firom the firaction of sequenced geiK)mic 
DNA and cDNAs from C. elegans, that about 30% of the 
total number of its genes are currently known (Wilson et 
al. 1994; Beiks and The C. elegans Cjenome Mapping 
and Sequencing Consortium 1995; Hodgkin et al. 1995). 
Assuming both a random distribution of FGF-encoding 
genes throughout the C. elegans genome and good efiB- 
ciency in the search strategies used here, this restricts the 
number of potential FGF genes in this spedes. It is thus 
imlikely that eight genes, orthologous to each FGF and 
to the FGF1/FGF2, FGF4/FGF6, and FGFlO/FGFll 
ancestors, are present in C. elegans, A likely l^^pothesis, 
sufBcient to explain the evolution of the FCff family, is 
that FGF expansion occurred after the separation of deu- 
terostomia and protostomia and was contemporaneous 
with a phase of global gene duplications that took place 
during the period leading to vertebrate emergence (Hol- 
land et al. 1994). Confirmation of this hypothesis will 
await identification of more FGF sequences in the inver- 
tebrates. 

The topology of the FGFR tree is strongly in support 
of this hypothesis if one assumes the two famihes have 
coevolved. The fuU complexity of the FGF receptor sys- 
tem already exists in anq)hibia (Thisse et aL 1995) and 
bony fish (Emori et al. 1992) as a probable coevohition 
with its ligand famify, but not in insects or echinoderms. 
Current failure to identify any FGF in Drosophila could 
be due to technical reasons, or, alternatively, be due to an 
evolutionary process which has resulted in the loss or 
absence of this type of gene in insects. Insects are known 
to have evolved as a separate branch of the metazoan tree 
distant from chordates (Fig. 5). The fact that FGF recep- 
tors were found in Drosophila (Klambt et al. 1992; 
Shishido et al. 1993) would appear to argue against the 
latter hypothesis but does not constitute definitive proof. 
In a similar naanner, tyrosine kinase neurotrophin recep- 
tors have been identified in the fly, but their activation 
occurs by way of homopl^lic interaction and is indepen- 
dent of hgands (Pulido et al. 1992). In aity case, the 
FGFRs characterized in Drosophila are different from 
the vertebrate FGFRs (Shishido et al. 1993). It is prob- 
able that the full complexity of the family, as it exists in 
mammals, is not developed in this species. Thus the low 
number or even complete absence of FGFs in insects is 
consistent with our l^^pothesis of FGF expansion asso- 
ciated with the origin of vertebrates. 

It is interesting to note that FGFs play important roles 
in the development of the skeletal system, as shown by 
the characterization of mutations in their receptors in 
inherited human diseases. Mutations of FGFRl, FGFR2, 
and FGFRS lead to disorders of the long bones and of the 
flat bones of the skull associated witii achondroplasia 



and craniosynostosis, respectively (Muenke and Schell 
1995; Wilkie et al. 1995), (It is interesting to note that 
FGER4 behaves slightly differently firom the other three 
receptors in several ways, including its topology in the 
tree and its noninvolvement in human inherited dis- 
eases.) Moreover, studies of limb bud growth and sclero- 
tome formation have demonstrated the inqx)rtant role of 
FGFs in this process (Tickle 1995; Tanaka and Gann 
1995; Grass et al. 1996). It is therefore tempting to 
speculate that e?q)ansion of the FGF/FGFR families is 
associated with the emergence of the vertebrate systems 
of motricity. 

Possible Role for the Late Duplication Events 

When did the phase of late diq)lications take place? Iden- 
tification of FGF4 and FGF6 or of their ancestor in 
fishes, ain)hibia, and birds could set the time for this 
step, providing they represent all pltyla, without loss 
through extinction 

The crucial role of FGF4 in limb bud development 
could suggest that the FGF4-FGF6 duplication event 
originated as a consequence of the generation of the mor- 
phological TOvelty that is the tetrapod limb during the 
fin-to-limb transition (Nelson and Tabin 1995), and that 
it took place after the origin of the fish lineage. This 
hypottesis suggests that FGF4 and FGF6 orthologs may 
actually exist in anq)hibia, but this is not firmly estab- 
lished. The identification of four FGF genes in fix)g and 
bird and the analysis of the pl^logenetic tree suggest that 
FGF genes orthologous to the mammalian FGFs exist in 
these species. Are there amphibian or avian genes or- 
thologous to each manunalian gene or to only eight of 
them? In other words, did the last step of duphcation, 
which created FGFl and FGF2, FGF4 and FGF6, and 
FGFIO and FGFl I occiff before or after the origin of 
amphibia and birds? Four FGF genes have been isolated 
in hog (Isaacs et aL 1992). Two of these sequences, 
designated xl-FGF4-I and xl'FGF4-II and corresponding 
to pseudo-alleles, are highly related to both FGF4 and 
FGF6 (75-76% and 74-75% amino acid identity with 
hs-FGF4 and lis-FGF6, respectively). A growth fector, 
related to both FGF4 and FGF6, has been characterized 
in chicken (Niswander et al. 1994). Like the abave- 
mon^iomd Xenopus FGF, gd-FGF4 seems only shghtly 
more related to FGF4 than to FGF6 (Table 1—80% and 
78% identity with hs-FGF4 and hs-FGF6, respectively). 
Thus FGF4 and FGF6 orthologs may not exist in birds. 
It is thus possible to speculate that the FGF4/FGF6 du- 
phcation may have occurred after the separation of the 
bird/reptile branch. While the inferred tree groups to- 
gether mammal and nonmammal FGF4 (Fig. 3AJB), it 
should be noted that this grouping has a very low boot- 
strap value (44%) and may not be significant Interest- 
ingly, FGF4 function is essential for early postimplanta- 
tion events in the mouse (Feldman et al. 1995), a role not 
relevant to birds or amphibia 
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Table 3. Chromosomal localization of genes of the FGF family in humans and the mouse 



Qiromosomal localization in: 



Gene 


Humans 


Mouse 
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3 A2— B 
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(Casey et aL 1986; Peters et aL 1984) 
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(Adelaide et aL 1988; Peters et aL 1989) 
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4q21 


5E1-F 


(Nguyen et aL 1988; Mattei et aL 1992) 
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(Maries et aL 1989; deLapeyriere et aL 1990) 
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(Kelley c/ 1992; Mattet et aL 1995a) 


FGF8 


10q25-q26 


19C3-D 


(White et al. 1995; Mattei et aL 1995a) 
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(Maltei c/ 1995b) 


FGFIO 









FGFll 



The presence of cMck FGF I and FGF2 genes indi- 
cates that the duplication of FGF1/FGF2 occurred be- 
fore the origin of the bird/reptile lineage. xl-FGFl se- 
quence is more related to hs-FGF2 (86%) than to 
hs-FGFl (55%), and the presence of FGFl protein has 
been reported in X, laevis (Shiuiba et aL 1991), The 
FGFI/FGF2 duplication may therefore have occurred 
even before the origin of amphibia The late duplication 
phase could thus be subdivided into two events (Fig. 5), 
the FGF1/FGF2 duplication having occiured before the 
origin of amphibia and the FGF4/FGF6 duplication as 
late as after the separation between birds aBd mammals. 

As only the human FGFIO and FGFll genes have 
been identified, there are no data upon which to speculate 
as to the timing of the FGFlQ/FGFll duplicatioa The 
failure to have detected either of these closely related 
FGFs prior to the search of sequence databases may be 
more than coincidental aiKl reflect somewhat different 
functions of these two new FGFs, 

In humans and in the mouse, two species in which 
chromosomal localization of the FGF genes has been 
determined, FGFs are located on different chromosomes. 
As an exception, FGF3 and FGF4 are tandemly linked 
on chromosomal band llql3 in humans and on chromo- 
some 7 in the mouse (Table 3). It could be noted also that 
FGF4 and FGF6 on chromosomes 1 1 arMi 12 on the one 
hand, and FGFl and FGF2 on chromosomes 4 and 5 on 
the other hand, are, respectively, on pairs of chromo- 
somes that appear to contain paralogous genes and are 
thought to derive from each otter (Lundin 1993). The 
idendfication of remnants of genome evolution may be 
explained by late events of diq)licatioa These events 
could be related to the FGF late diq)lication phase. 

An Integrated View ofFGF/FGFR Function 
and Evolution 

The presence of an FGF gene in C. elegans allows for 
the hypothesis of a scheme of evolution before and after 
the protostomia/deuterostomia separatioa 



The biological role of an FGF protein in nematodes is 
open to investigatioa It could be involved in mesoderm 
induction and mesoderm development or positioning. 
Studies of mutant FGFR gene in C. elegans suggest FGF/ 
FGFR could be associated with development orpositioning 
of the muscle system (EteVore et al. 1995). It is teiiq)tir^ to 
speculate that FGFs are contemporary of tr5)loblast phyla 
and m^ not exist in diploblast species (Fig. 5). 

Among various possible roles during embryogenesis, 
recent studies have shown that FGFs are involved in 
migration and patterning during the formation of the 
skeletal system of mammals. Failure to infer a robust 
phylogenetic tree fiom available data suggests the FGF 
genes have evolved considerably since their separation 
from a conunon ancestor and may be explained by a 
series of duplications occurring early in evolutioa This 
expansion in the number of FGF genes may have been 
an important determinant of skeletal system formation in 
vertebrates. A second and late phase of FGF duphcations 
may be related to the establishment of improved signal- 
ing networks — also involving BMP, WNT, Hedgehog, 
and HOX family members — responsible for the fin-to- 
Umb transition or to the spht between ray- and lobe- 
finned borq^ fish This scheme is reminiscent of the pre- 
sumed evolution of the homeobox-containing HOX 
genes, during which the an:q)lification of a single cluster 
was coincidental with the transition fiom invertebrate chor- 
dates to vertebrates and was followed by the acquisition of 
extra HOXgrnes in relation to the appearance of vertebrate 
heal and limb. The FGF ard HOX gene famihes could 
have coevolved, leading ultimately to a higher conq)lexity. 
More generally, the expansion of famihes of genes is pre- 
sumed to have coincided with metazoan radiatioa Further 
identification of FGF arHl FGFR genes in otter organisms 
and cortq)arison of their protein sequences are necessary to 
help confirm or refute the above hypothesis. 

Note Added in Proof 

While this manuscript was in press, Itoh and co-workers 
reported the isolation of a new FGF gene in rat, which 
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they called FgflO [Yamasaki et al., J. Biol Chem (1966) 
271.15918-15921]. The two new human FGF genes de- 
scribed in our paper should therefore be designated 
FGFll and FGF12, FGF homologous factor (FHF)-1 to 
-4 genes have also been reported recently [Smallwood et 
al., Proc Natl Acad Sci USA (1996) 93:9850-9857]. 
FHF'3 and FHF'I correspond to FGFU and FGF 1 2 
described here. We propose FHF-2 and FHF-4 be des- 
ignated FGF] 3 and FGFI4, in agreement with the rec- 
ommendation of the Nomenclature Committee [Baird et 
al., Ann NY Acad Sci (1991) 638:xiii-xxvi]. 
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Summary 



Fibroblast growth factors (FGFs) make up a lai^e family of polypeptide growth factors that are found 
in organisms ranging from nematodes to humans, in vertebrates, the 22 members of the FGF family 
range in molecular mass from 1 7 to 34 kDa and share 1 3-71% amino acid identity. Between vertebrate 
species, FGFs are highly conserved in both gene structure and amino-acid sequence. FGFs have a high 
affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell- 
surface FGF receptors. During embryonic development, FGFs have diverse roles in regulating cell 
proliferation, migration and differentiation. In the adult organism, FGFs are homeostatic factors and 
function in tissue repair and response to Injury. When inappropriately expressed, some FGFs can 
contribute to the pathogenesis of cancer. A subset of the FGF family, expressed in adult tissue, is 
important for neuronal signal transduction in the central and peripheral nervous systems. 



Gene organization and evolutionary history 
Gene organization 

The prototypical Fgf genes contain three coding exons 
(Figure 1), with axon 1 containing the initiation methionine, 
but several Fgf genes (for example, FQf2 and Fgfs) have 
additional 5' transcribed sequence that initiates from 
upstream CUG codons [1,2]. The size of the coding portion of 
Fgf genes ranges from under 5 kb (in Fgf 3 and Fgf4) to over 
100 kb (in Fgfi2\ In several Fgfsubfamilies, exon 1 is subdi- 
vided into between two and four alternatively spliced sub- 
exons (denoted lA-iD in the case of Fgf8). In these Fgf 
genes, a single initiation codon (ATG) in exon lA is used. 
This gene organization is conserved in humans, mouse and 
zebrafish, but its functional consequences are poorly under- 
stood. Other subfamilies of Fgfs (such as Fgf 11-14) have 
alternative amino termini, which result from the use of alter- 
native 5' exons. It is not known whether a common 5' 



untranslated exon splices to these exons or whether alterna- 
tive promoter and regulatory sequences are used. , ; 

Most Fgf genes are found scattered throughout , the genome. 
In human, 22 FGF genes have been identified and the chro- 
mosomal locations of all except FGF16 are known (Table 1) 
[3-7]. Several human FGF genes are clustered within the 
genome. FGF3, FGF4 and FGF19 are located on chromosome 
iiqi3 and are separated by only 40 and 10 kb, respectively; 
FGFS and FGF23 are located within 55 kb on chromosome, 
I2pi3; and FGFi^ and FGF20 map to chromosome 8p2i-p22. 
These gene locations indicate that the FGF gene family was 
generated both by gene and chromosomal duplication and 
translocation during evolution. Interestingly, a transcription- 
ally active portion of human FGF7, located on chromosome 
I5qi3-q22, has been amplified to about 16 copies, which are 
dispersed throughout the human genome [8]. 
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Figure I 

Gene structure of selected members of the Fgf family. Only 
the portion of each gene containing coding exons is shown. 
Constitutively expressed exons are in black; alternatfvefy 
spliced exons are in gray. Fgfs I, 2, 4 and 9 contain the proto- 
typic three-exon or^ganization. For Fgfl, 5* untranslated exons 
are not shown; inclusion of these exons extends the gene by 
approximately 69 kb [78]. Fgf8 is an example of a gene with 5' 
alternative splicing, and Fgfl 3 demonstrates alternatively used 
5' exons separated by over 30 kb. References: Fgfl [781; Fgf2 
[79]; Fgf4 [80]; Fgf8 [52]; Fgf9 [81]; FgftS [76]. 



In the mouse, there are at least 22 FgJ genes [3,9], and the 
locations of 16 have been identified (Table 1). Many of the 
mouse Fgf genes are scattered throughout the genome, but 
as in the human, Fgfs, Fgf4 and Fgfig are closely linked 
(within 80 kb on chromosome 7F) and Fgf6 and are 
closely linked on chromosome 6F3-G1. 

Evolutionary history 

Fgfs have been identified in both invertebrates and verte- 
brates [3]. Interestingly, an Fgf -like gene is also encoded in 
the nuclear polyhedrosis virus genome [10]. FgJ-\\ke 
sequences have not been found in unicellular organisms 
such as Escherichia coli and Saccharomyces cerevisiae. 
Although the Drosophila and Caenorhabditis elegans 
genomes have been sequenced, only one FgJ gene {branch- 
less) has been identified in Drosophila [11] and two (egl-17 
and let-^56) have been identified in C. elegans [12,13], in 
contrast to the large number of FgJ genes identified in verte- 
brates. The evolutionary relationship between invertebrate 
and vertebrate Fgfs is shovm in Figure 2a. 



The FgJ gene expansion has been hypothesized to be coinci- 
dent with a phase of global gene duplications that took place 
during the period leading to the emergence of vertebrates 
[14]. Across species, most orthologous FGF proteins are 
highly conserved and share greater than 90% amino-acid 
sequence identity (except human FGF15 and mouse Fgfi9; 
see below). To date, four Fgfs {Fgfs, S, 17 and 18) have been 
identified in zebrafish, seven {Fgfs, Fgf(i), Fgf(ii), FgfS, 9 
and 20) in Xenopus {Fgf(i) and Fgf[ii) are most closely 
related to Fgf4 and Fgf6 [15]) and seven {Fgfs, 4, 8, 12, 14, 
18 and 19) in chicken [3]. 

The apparent evolutionary relationships of the 22 known 
human FGFs are shown in Figure 2b. Vertebrate FGFs can be 
classified into several subgroups or subfamilies. Members of 
a subgroup of FGFs share increased sequence similarity and 
biochemical and developmental properties. For example, 
members of the FGFS subfamily (FGFS, FGF17, and FGF18) 
have 70-80% amino acid sequence identity, similar receptor- 
binding properties and some overlapping sites of expression 
(for example, the midbrain-hindbrain junction) [16,17]. 
Members of FGF subgroups are not closely linked in the 
genome, however, indicating that the subfamilies were gener- 
ated by gene-translocation or by genome-duplication events, 
not by local duplication events. 

Human FGF15 and mouse Fgfig have not been identified. 
Human FGF19 is evolutionarily most closely related to 
mouse Fgfis (51% amino acid identity; Figure 2b) [18] and 
both the human FGF19 and mouse Fgf 15 genes are closely 
linked to the human and mouse Fgfs and Fgf4 genes on 
orthologous regions of human chromosome iiqi3 and 
mouse chromosome 7F (N.I., unpublished observations). 
These findings indicate that human FGFig may be the 
human ortholog of mouse Fgfis, Because all other FgJ 
orthologs share greater than 90% amino acid identity, it 
remains possible that the true orthologs of these genes have 
not been identified, have been lost or have diverged during 
vertebrate evolution. 



Characteristic structural features 

FGFs range in molecular weight from 17 to 34 kDa in verte- 
brates, whereas the Drosophila FGF is 84 kDa. Most FGFs 
share an internal core region of similarity, with 28 highly 
conserved and sbc identical amino-acid residues [19]. Ten of 
these highly conserved residues interact with the FGF recep- 
tor (FGFR) [20]. Structural studies on FGFi and FGF2 iden- 
tify 12 antiparallel p strands in the conserved core region of 
the protein (Figure 3) [21,22]. FGFi and FGF2 have a p trefoil 
structure that contains four-stranded p sheets arranged in a 
triangular array (Figure 3b; reviewed in [23]). Two p strands 
(strands pio and pii) contain several basic amino-acid 
residues that form the primaiy heparin-binding site on FGF2. 
Regions thought to be involved in receptor binding are dis- 
tinct from regions that bind heparin (Figure 3) [21-24]. 
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Chromosomal localizations of FGFs in human and mouse 


Human Mouse 
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Human Mouse 



FGft 

FCF2 

FGF3 

FGF4 

FGFS 

FGF6 

FGF7 

FGFS 

FGF9 

FGFtO 

FGFil 
(FHF3) 

FGFI2 
(FHFt) 

FGFI3 
(FHF2) 

FGFI4 
(FHF4) 

FGF\6 
FGFtJ 
FGFIB 
FGFI9* 
.FGF20 
FGF2I 
FGF22 
FGF23 



5q31 

4q26-27 

Ilql3 
tlql3.3 
4q2t 
I2pl3 
I5ql5-2I.I 

I0q24 
I3qll-ql2 
5pl2-pl3 
I7pl3.! 

3q28 

Xq26 

I3q34 



8p2t 
5q34 
Ilql3.l 
8p2l.3-p22 
I9ql3,l-qter 
!9pl3.3 
I2pl3.3 



Fgf\ 
Fzfl 

FgfS 
Fgfh 
FgfT 

FgP 
fgflO 
FgfW 

fgfI4 

Fgf]S 
^7 
fgflB 

FgflO 
Fgfl\ 
Fgfl2 

Fsa^ 



18 [82.83] 

[84.85] 

[86-88] 
[87.89] 
[85.90] 
[91,92] 
[93.94] 
[54.95] 
[81.96.97] 
[98.99] 
[100] 

[31.100-102] 

X [31.76.103] 

14 [31] 

7F (N.L. unpublished obserrations) 

14 [104] 
[105] 
[106] 
[27.107] 
[108] 
[109] 

6F3-G I [7.75] (N.I.. unpublished) 



X6S778. E03692. 
E04557 



U676I0, M3064I 



3A2-B 

7F 
7F 
5EI-F 
6F3-GI 
2F-G 
I9C3-D 

I4D 
I3A3-A4 



I6BI-B3 



E05628. M27968 


M30644. AF065903, 




AF065904. AF065905 


XI 4445 


Y00848 


E03343 


M30642 


M37825 


M30643 


X63454 


M924I6 . 


M60828 


Z22703 


U36223. U56978 


Z48746 


D 14838 


U33535. D38258 


AB002097 


. D89080 


U66f99 


U66203 


U66I97 


U6620I 


U66I98 


U66202, AR)20737 

c 


U66200 


U66204 



AB00939I 

AB009249 
AB007422. AF075292 
AB0I8I22. AFI 10400 
AB030648. AB044277 

AB02I975 

AB02I925 
AB037973. AF263537 



AF007268 
AB0492I9 
AB009250 
AB004639. AF07529I 

. AB0492I8 
• AB0257I8 
AB036765 
AB037889.AF263536 



*Human fGF/9 and mouse FgfIS may be orthologous genes. 



Localization and function 
Localization 

SubceMar ^ocalization and secretion 

Most FGFs (FGFs 3-8, 10, 15, 17*19, and 21-23) have amino- 
terminal signal peptides and are readily secreted from cells. 
FGFs 9, 16 and 20 lack an obvious amino-terminal signal 
peptide but are nevertheless secreted [25-27]. FGFi and FGF2 
also lack signal sequences, but, unhke FGF9, are not secreted; 
they can, however, be found on the cell surface and within the 
extracellular matrix. FGFi and FGF2 may be released from 
damaged cells or could be released by an exocytotic mecha- 
nism that is independent of the endoplasmic-reticulum-Golgi 



pathway [28]. FGF9 has been shown to contain a non-cleaved 
amino-terminal hydrophobic sequence that is required for 
secretion [29,30]. A third subset of FGFs (FGFii-14) lack 
signal sequences and are thought to remain intracellular 
[31-34]. It is not known whether these FGFs interact with 
known FGFRs or function in a receptor-independent manner 
within the cell. FGF2 and FGF3 have high-molecular-weight 
forms that arise from initiation from upstream CUG codons 
[2,14,35]. The additional amino-terminal sequence in these 
proteins contains nuclear-localization signals, and the pro- 
teins can be found in the nucleus; the biological function of 
nudear-locahzed FGF is unclear. 
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Figure 2 

Evolutionary relationships within the FGF family, (a) Apparent evolutionary relationships between FGFs from vertebrates, 
invertebrates and a virus. Amino-acid sequences of nine representative FGFs were chosen from human and compared with 
FGFs from Drosophila, C elegans, zebrafish and Autographa californica nuclear polyhedrosis virus, (b) Apparent evolutionary 
relationships of the 22 known human and murine FGFs. Sequences were aligned using Genetyx sequence analysis software 
and trees were constructed from the alignments using the neighbor-joining method. 



Developmental expression patterns and fitnction 
The 22 members of the mammalian FGF family are differen- 
tially expressed in many, if not all, tissues, but the patterns 
and timing of expression vary. Subfamilies of FGFs tend to 
have similar patterns of expression, although each FGF also 
appears to have unique sites of expression. Some FGFs are 
expressed exclusively during embryonic development (for 
example, Fgfs, 4, 8, 15, 17 and 29), whereas others are 
expressed in embryonic and adult tissues (for example, Fgfi, 
2, 5-7, i6, iB, and 20-23). 

Function 

The expression patterns of FGFs (see above) suggest that they 
have important roles in development. FGFs often signal 



directionally and reciprocally across epithelial-mesenchymal 
boundaries [36]. The integrity of these signaling pathways 
requires extremely tight regulation of FGF activity and 
receptor specificity. For example, in vertebrate limb develop- 
ment, mesenchymally expressed Fgf 10 in the lateral-plate 
mesoderm induces the formation of the overlying apical ecto- 
dermal ridge; the ridge subsequentiy expresses FgfB, which 
signals back to the underlying mesoderm [37]. This direc- 
tional signaling initiates feedback loops and, along with other 
signaling molecules, regulates the outgrowth and patterning 
of the limb. Importantiy, the differential expression of the 
alternative splice forms of the receptors in the apical ectoder- 
mal ridge and underlying mesoderm is such as to limit or 
prevent autocrine signaling within a given compartment. 
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Figure 3 

(a) Strurtural features of the FGF polypeptide. The amino 
terminus of some FGFs contains a signal sequence (shaded). 
All FGFs contain a core region that contains conserved 
amino-acid residues and conserved structural motifs. The 
locations of p strands within the core region are numbered 
and shown as black boxes. The heparin-binding region (pink) 
includes residues in the loop between p strands I and 2 and 
in p strands 10 and 1 1 . Residues that contact the FGFR are 
shown in green (the region contacting Ig-domain 2 of the 
receptor), blue (contacting Ig-domain 3) and red (contacting 
the alternatively spliced region of Ig-domain 3). Amino-acid 
residues that contact the linker region are shown in gray 
[20]. (b) Three-dimensional structure of FGF2. a 
prototypical member of the FGF family. A ribbon diagram of 
FGF2 Is shown; p strands are labeled i-12 and regions of 
contact with the FGFR and heparin are color-coded as in (a) 
[22.24]. Image provided by M. Mohammadi. 



Studies of the biochemical activities of FGFs have focused on 
the specificity of interactions between FGFs and FGFRs, on 
factors that affect the stability of FGFs and on the composition 
and mechanism of the active FGF-FGFR signaling complex. 

Spedfidty of FGFs for FGF receptors 

The FGFR tyrosine kinase receptors contain two or three 
immunogjobulin-like domains and a heparin-binding 
sequence [38-40]. Alternative mRNA splicing of the FGFR 
gene specifies the sequence of the carboxy-terminal half of 
immunoglobulin-domain III, resulting in either the Illb or 
the IIIc isoform of the FGFR [41-43]- This alternative-splic- 
ing event is regulated in a tissue-specific manner and dra- 
matically affects ligand-receptor binding specificity [44-48]- 
Exon Illb is expressed in epithelial lineages and exon IIIc 
tends to be expressed in mesenchymal Hneages [44,46-48]. 
In vitro patterns of binding specificity have been determined 



for each spHce form of FGFR1-3 and for FGFR4, which is not 
alternatively spliced [49-51]. Ligands specific for these 
receptor splice forms are expressed in adjacent tissues, 
resulting in directional epithelial-mesenchymal signaling. 
For example, epithelially expressed FGFR2b (that is, FGFR2 
Illb isoform) can be activated by FGF7 and FGFio, ligands 
produced in mesenchymal tissue [49-51]- These ligands 
show no activity towards mesenchymally expressed FGFR2C. 
Conversely, FGFS is expressed in epithelial tissue and acti- 
vates FGFR2C but shows no activity towards FGFR2b 
([49,52] and our unpublished observations). Notably, FGF8 
expression is often restricted to epithelial tissue such as the 
apical ectodermal ridge of the developing limb bud [53»54]- 

Interactjon with heparir) or heparar) sulfate proteoglycans 
An important feature of FGF biology involves the interaction 
between FGF and heparin or heparan sulfate (HS) proteogly- 
can (HSPG) [19]. These interactions stabilize FGFs to 
thermal denaturation and proteolysis and may severely limit 
their diffusion and release into interstitial spaces [55>56]. 
FGFs must saturate nearby HS-binding sites before exerting 
an effect on tissue further away, or else must be mobilized by 
heparin/HS-degrading enzymes. The interaction between 
FGFs and HS results in the formation of dimers and higher- 
order oligomers [57*59]. Although the biologically active 
form of FGF is poorly defined, it has been established that 
heparin is required for FGF to effectively activate the FGFR 
in cells that are deficient in or unable to synthesize HSPG or 
in cells pretreated with heparin/HS-degrading enzymes or 
inhibitors of sulfation [60-62]. Genetic studies have also 
shown that mutations in enzymes involved in HS biosynthe- 
sis affect FGF signaling pathways during development 
[19,63]. Additional studies have shown that heparin and/or 
HS act to increase the affinity and half-life of the FGF-FGFR 
complex (reviewed in [40,64]). 

A minimal complex containing one FGF molecule per FGFR 
can form in the absence of HS [24]. Structural studies 
suggest that HS may bridge FGF2 and the FGFR by binding 
to a groove formed by the heparan-binding sites of both the 
ligand and the receptor [24,65]. Binding studies with soluble 
chimeric FGFRs have identified a second potential FGF- 
binding site that, in some cases, can interact cooperatively 
with the primary FGF-binding site [66]. 

important mutants 

Many members of the Fgf family have been disrupted by 
homologous recombination in mice. The phenotypes range 
from very early embryonic lethality to subtle phenotypes in 
adult mice. The major phenotypes observed in Fqf knockout 
mice are shown in Table 2. Because FGFs within a subfamily 
have similar receptor-binding properties and overlapping 
patterns of expression, functional redundancy is likely to 
occur. This has been demonstrated for Fgfi7 and Fgf8, which 
cooperate to regulate neuroepithelial proliferation in the mid- 
brain-hindbrain junction [17]. In the case of FgJ knockouts 
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resulting in early lethality, other functions later in develop- 
ment will need to be addressed by constructing conditional 
alleles that can be targeted at specific times and places in 
development. For example, Fgf8 mice die by embryonic 
day 9.5 [67]. A conditional allele for FgfS targeted to the 
apical ectodermal ridge has been used to demonstrate an 
essential role for FgfS in early limb development [68,69]. 

Several mutations in Fgf genes have been identified in 
C. elegans, Drosophila, zebrafish, mouse and human. The 
C. elegans gene egl-ip is required for sex myoblast migration 
[12], and a null allele of let-756 causes developmental arrest 
of the early larva [13]. The Drosophila branchless gene is 
required for tracheal branching and cell migration [11]. In 
zebrafish, acerebellar (ace) embryos lack the cerebellum 
and the midbrain-hindbrain boundary organizer. The ace 
gene encodes the zebrafish homolog of FgfS [70]. Interest- 
ingly, zebrafish aussicht mutant embryos, which overexpress 
FgfS, also have defects in development of the central 
nervous system [71]. 

In the mouse, the angora mutation, which affects hair 
growth, was found to be allelic with Fgfs [72]. A mouse 
mutant with a Crouzon-syndrome-like craniofacial dysmor- 
phology phenotype was fotmd to result from an insertional 

Table 2 



FGF knockout mice 



Gene 


Survival of null mutant* 


Phenotype 


References 


fgfl 


Viable 


None identified 


[110] 




Viable 


Mild cardiovascular, skeletal, neuronal 


[110-114] 


Ffif3 


Viable 


Mild inner ear, skeletal (tail) 


[115] 




Lethal, E4-5 


Inner ceil mass proliferation 


[116] 


FgfS 


Viable 


Long hair, angora mutation 


[72] 




Viable 


Subtle, muscle regeneration 


[117-119] 


fgf7 


Viable 


Hair follicle grovnh, ureteric bud growth 


[120.121] 




Lethal, E7 


Gastruladon defect, CNS development, limb development 


[67,70,122.123] 




Lethal. PO 


Lung mesenchyme, XY sex reversal 


[l24];(J.S.Colvineto/., 
personal communication) 




Lethal. PO 


Development of multiple organs, including limb, lung, 
thymus, pituitary 


[125-127] 


(Fhfl) 


Viable 


Neuromuscular phenotype 


(J. Schoorlemmer and M. Goldfarb, 
personal communication) 


(Fbf4) 


Viable 


Neurological phenotypes 


(Q. Wang, personal communication) 


FgfIS 


Lethal, E9.5 


Not clear 


(J.R. McWhirter, personal communication) 




Viable 


Cerebellar development 


[17] 


Fgft8 


Lethal. PO 


Skeletal development 


(N. Ohbayashi, Z. Uu. 
personal communication) 



*E. embryonic day; P, postnatal day. 



mutation in the Fgf3/Fgf4 locus [73]. Recently, positional 
cloning of the autosomal dominant hypophosphataemic 
rickets gene identified missense mutations in human 
FGF23 [74]. A recent paper demonstrates that this disease 
is caused by a gain-of-function mutation [75]. The chromo- 
somal location (Xq26) and tissue-specific expression 
pattern of Fgfis (also called Fhf2) suggests that it may be a 
candidate gene for Borjeson-Forssman-Lehmann syn- 
drome, an X-linked mental retardation syndrome [76]. 

Frontiers 

Issues most studied 

FGFs have been intensely studied for nearly 30 years. Most 
of the early work focused on the mechanisms that regulate 
stability, secretion, export and interactions with heparin and 
on the mechanisms and consequences of signal transduction 
in various types of cells. More recent work has focused on 
the mechanisms regulating receptor specificity and receptor 
activation, the structure of the FGF-FGFR-HS complex, and. 
the identification of new members of the FGF family. Func- 
tional studies have begun to address the role of FGFs in cell 
biology, development and physiology. Initial studies focused 
on the regulation of cell proliferation, migration and differ- 
entiation; more recent work has addressed the negative 
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effect of FGFs and FGFRs on proliferation of some cell types, 
which was surprising as FGFs were thought to promote pro- 
liferation. In vitro studies have now been complemented by 
gene targeting in mice. The knockout approach has been 
fairly successful in identifying primary phenotypes but will 
be challenged by the need to address redundancy amongst 
the 22 FGFs and to study their developmental and physio- 
logical functions after the point of lethality of the null allele. 

Unresolved questions 

A major unresolved question concerns the mechanisra(s) reg- 
ulating FGF activity in uivo in the presence of cell-surface and 
extracellular-matrix HSPG. Current hypotheses predict that 
tissue-specific heparan fragments of defined sequence (and 
particularly of defined sulfation pattern) will differentially 
regulate FGFs by controlling their diffusion in the extracellu- 
lar matrix and their ability to activate specific receptors [77]. 
These issues will be resolved by determining the sequence of 
tissue-specific HS and by demonstrating whether specific HS 
sequences can modulate the binding specificity of FGFs 
beyond that determined by the specific FGFR and its alterna- 
tive splice form in the presence of heparin. 

A second area of research will aim to elucidate the develop- 
mental roles of all the FGFs, first alone and then in various 
combinations. This will include determining whether a single 
FGF with a defined developmental function interacts with one 
or multiple FGFRs. A third major fix)ntier will be to elucidate 
the physiological roles of FGFs that are expressed in adult 
tissues. This will again involve testing combinations of FGFs 
in cases in which knockouts are viable and designing condi- 
tional alleles in cases of embryonic lethality. Major areas being 
considered include neuronal and cardiovascular physiology, 
neuronal regeneration and homeostasis and tissue repair. 

The last major frontier will be to elucidate the primary roles 
of FGFs in genetic diseases and cancer. Several FGFs were 
initially cloned fi-om human and animal tumors. Future 
work will be required to determine whether FGF activation 
is itself an etiological agent in primary human tumors or 
whether it is a progression factor in the pathogenesis of 
cancer. As functional roles for FGFs are elucidated in embry- 
onic development, it is expected that various human birth 
defects and genetic diseases will be attributed to mutations 
in Fgf genes. These studies will probably lead to the develop- 
ment of pharmacogenetic agents to treat these diseases. 
Because a large number of skeletal diseases are caused by 
mutations in Fgfr genes, it is anticipated that mutations in 
some Fgf genes will also be involved in skeletal pathology. 
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Identification of isofomns of Fhf4/Fgfl 4 that result from the alternative 
usage of two different first exons, Isofomns show different subcellular 
localization and distinct expression patterns in developing and adult 
mouse tissues. Expression was observed in migrating and post migra- 
tory neurons. 
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hypophosphatemic osteon^alacia tumors. 



76. Gecz j. Baker E, Donnelly A, Ming JE, McDonald-McGinn DM, Spinner 
NB. Zackai EH, Sutherland GR, Mulley JC: Fibroblast growth 
factor homologous factor 2 (FHF2): gene structure, expres- 
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87. Yoshida CM, Wada M, Satoh H, Yoshida T, Sakamoto H, Miyagawa 
K, Yokota J, Koda T, Kakinuma M, Sugimura T, et o/.: Human HSTI 
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oncogenesis in mouse mammary epithelial cells. 
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